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ABSTRACT 


This paper proposes a two-stage instrumental variable quantile 
regression (2S-IVQR) estimation to estimate the time-invariant 
effects in panel data model. In the first stage, we introduce the 
dummy variables to represent the time-invariant effects, and use 
quantile regression to estimate effects of individual covariates. The 
advantage of the first stage is that it can reduce calculations and 
the number of estimation parameters. Then in the second stage, we 
adapt instrument variables approach and 2SLS method. In 
addition, we present a proof of 2S-IVQR estimator's large sample 
properties. Monte Carlo simulation study shows that with 
increasing sample size, the Bias and RMSE of our estimator are 
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decreased. Besides, our estimator has lower Bias and RMSE than 


those of the other two estimators. 
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1. INTRODUCTION 

Panel data not only reflects the individual 


heterogeneity of cross-section data, but also shows 
the dynamic information about the time series data. 
Quantile regression can describe the independent 
variable X for the dependent variable Y range 
accurately, and capture systematic influences of 
covariates on the location, scale and shape of the 
conditional distribution of the — response. 
Additionally, there is no need for quantile 
regression model to make any assumptions on the 
overall distribution, compared with the ordinary 
least squares method. 


Quantile regression model for panel data, can fully 
describe the conditional distribution of the response 
variable as well as control the variability. There are 
three types of estimation of quantile regression for 
panel data with fixed effects: the penalty 
estimation, two-step estimation, minimum distance 
estimation. 


The penalty estimation mainly considers adding a 
penalty item in_ the objective function. 
Koenker(2004) proposed a general approach to 
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Quantile 


estimate quantile regression models for longitudinal 
data employing 1, regularization methods. 
Lamarche(2010) proved, in theory, that there 
existing optimal penalty parameter of penalized 
quantile regression for panel data with fixed effect. 
Galvao and  Montes-Rojas(2010) proposed 
penalized quantile regression for dynamic panel 
data, using instrumental variables to resolve the 
problem of endogeneity. Galvao(2011) also adopted 
instrumental variables approach to study a quantile 
regression dynamic panel model with fixed effects. 
Harding and Lamarche(2014) gave an (1 penalized 
quantile regression estimator which adapted the 
Hausman—Taylor instrumental variable approach. 
Lamarche(2014) proposed a penalized quantile 
regression estimator for panel data that explicitly 
considers individual heterogeneity associated with 
the covariates. Tao, Zhang and _ Tian(2017) 
proposed two new panel data instrumental variable 
estimators that combine the Huasman-Taylor 
instrumental variables of |Huasman and 
Taylor(1981) and 1, shrinkage of Koenker(2004) to 
resolve biased parameter estimation problem 
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caused by lagged response variables and random 
error. 


The main idea of the two-step estimation is to 
eliminate the fixed effect in the first step, use 
simple quantile regression for transformed data in 
the second step. Canay(2011) introduced a two-step 
estimator for panel data quantile regression models. 
It is noted that the two-step estimation method 
eliminates the fixed effect in the first step, which 
can greatly reduce the estimated parameters in 
quantile regression. Notethatdue to the first step, 
the two-step estimation cannot adapt the time- 
invariant independent variables in the panel data 
model. 


Galvao and Wang (2015) developed a new 
minimum distance quantile regression (MD-QR) 
estimator for panel data models with fixed effects. 
The MD-QR is defined as the weighted average of 
the individual quantile regression slope estimators. 
However, from the definition of MD-QR we can 
know, the model cannot adapt the time-invariant 
independent variables. Besides, MD-QR estimator 
ignores the endogenous problems. Galvao, Gu and 
Volgushev(2018) provided new insights on the 
asymptotic properties of the MD-QR estimator 
under the two different assumption, the assumption 
that data within individuals are independent and the 
assumption that data are dependence across time 
while maintain independence across individuals. 
There have been other growing studies on quantile 
regression for panel data, see e.g., Galvao(2011), 
Chetverikov, Larsen, and Palmer(2016), Gu and 
Volgushev(2019), Zhang, Jiang and Feng(2021). 


As the two-step estimation and minimum distance 
estimation fail when there are time-invariant 
independent variables in the model, we propose a 
two-stage instrumental variable quantile 
regression(2S-IVQR) estimation to estimate the 
time-invariant effects in panel data model. In the 
first stage, we introduce the dummy variables and 
perform quantile regression with all the data to 
estimate effects of individual covariates. Then in 
the second stage, we adapt instrument variables 
approach and 2SLS method. Moreover, we study 
the asymptotic properties of the proposed 
estimators. Monte Carlo simulation in various 
parameters sets proves the validity of the proposed 
approaches. We compare the bias and root mean 
squared error (RMSE) of the proposed estimators 
with the QR estimator's of Koenker and Bassett 
(1978) and the Grouped IVQR estimator's of 
Chetverikov, Larsen, and Palmer (2016). 


The rest of the paper is organized as follows. 
Section 2 presents two-stage instrumental variable 


quantile regression (2S-IVQR) estimator for panel 
data with time-invariant effects. Section 3 is 
devoted to the asymptotic behavior of the proposed 
estimators. Section 4 describes the Monte Carlo 
experiment. Finally, section 5 concludes the paper. 


2. Model and methods 
2.1. Basic Model 
Consider the panel data model that contains time- 
varying as well as time-invariant regressors: 
Vie =i + ZY + XitB + Find = 1,2,--, Nit 

= 1,2,-:-,T (1) 
where 
m=n+%, (2) 
XizIS ak X 1 vector of time-varying variables, and z; 
is an mX 1 vector of observed individual-specific 
variables that only vary over the cross-section units 
i. In addition to z;, the outcomes, y;, is also 
governed by unobserved individual specific effects, 
Ni: 
We consider the following model for the tth 


conditional quantile functions of the response of the 
tth observation on the ith individual y;,, 


Qy,,(TXits Zi, Mi) 

= i(t) + zjy(t) + xB O), (3) 
There has been a lot of research on how to estimate 
B(t), see Koenker(2004), Canay(2011), Galvao and 
Wang (2015), etc. Besides, the estimation method 
proposed by Canay(2011), Galvao and Wang 
(2015) can not identify y(t). Therefore, we are 
primarily interested in estimation y(t) 
inthispaper,that is,the focus of the following 
analysis is on estimation and inference involving 
the elements of y(t). 


2.2. 2S-IVQR estimator 

Chetverikov, Larsen, and Palmer(2016) are 
interested in estimating y(t), they propose the 
Grouped IVQR estimator. The Grouped IVQR 
estimator consists of two stages. First, estimate tth 
quantile regression of y,;,on x; using the data 
{(vie Xie): = 1,2,..,N}by the classical quantile 
regression(QR) estimator of Koenker and Bassett 
(1978). Second, estimate a 2SLS regression of 
intercept term coefficient on z;using instrument 
variablesto get an estimator f¢,ouped ivor(t) Of y(7). 


However, the Grouped ITVQR estimator calculates 
each individual data separately, that is, it needs to 
calculate n times in the first stage. To simplify 
computation and improve the estimation accuracy, 
we propose a two-stage instrumental variable 
quantile regression estimator(2S-IVQR) for model 


(3). 
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The 2S-IVQR estimator can be obtained via the 
following two stages. 


Stage 1 : For each individual i and each quantile rt, 
estimates tth quantile regression of y;,on x;, and J; 
using the data = {(yie,%iz,0j):i = 1,2,-,Nit = 
1,2,-:-,T }by the QR estimator: 

N T 
(B@,9@) = argmin YY pubic - HBO 

i=1 t=1 

— €8,(T) — ++ — eB (T)) (4) 

where 
B(T)=Ni (tT) + zy (T), (1) = (01 (7), (Oe: = 


1 belong to individual i = - 
fe sree PECs) = u(t — I(u < 0)) 
is the check function. 


Stage 2: Estimate a 2SLS regression of 3,(t) on 
z,;using w;as an instrument to get an estimator/(t) 
of y(t): 


9(2) ; 
= (Z'PyZ)~*(Z'Py9(z)), () 
where Z=(2,°":,Zy)',W =(wy,°",Wy)’, and 


Py =W(W'wW)-1W". 

As we can see, for fixed t, the first stage only 
needcalculate one time by introducing the dummy 
variable. In addition, the second stage employs the 
two-stage least squares method to get/2s5_ jyor(T) 
andf25_ ;yar(t)is a weighted combination of z; and 
Wj- 


3. Asymptotic theory 

Now we briefly discuss the asymptotic properties of 
thef2>_ jvgr(t). To establish the asymptotic 
properties of the 2s_j;yor(t), we impose the 
following regularity assumptions. 


Assumption 1: Observations are independent across 
individuals, and for all i = 1,---, N, (Xi, Viz) are 1.i.d. 
across t = 1,::-,T. 


Assumption 2: The set of quantile indices U is a 
compact set included in (0,1). 


Assumption3:(1) For all te€U and i=1,-,N, 
E|(wiy(z))| = 0.Gi) AsN > 0, N~1 YM, Elzjw'i] > 
Qzw andN~* iL E[Ww' i] > Qww where Qzw and 
Qww are matrices with singular values bounded in 
absolute value from above by Cyand from below by 
cy. (iii) For all i=1,-,N and t =1,-:,T,y,is 
independent ofw,conditional on(x;jz,z;). (iv) For all 
i=1,--,N, E[||wil|*7™] < Cy. 


Assumption 4: As N > 00,N2/3(logT)/T > 0. 


Assumption5: For all i=1,---,N, z;satisfy the 
moment conditions||Z;|| < Cy, ||xiel| < Cy- 


Assumption 6: For 1t,,t,€U and i=1,-,N, 

ll y(t2) — Yt) IIS C, Il T2 — Ty Il. 

Assumption 7:() For all i=1,:°,N, 

E [sup|neceiem| < Cy. (ii)ASN > ©, for t,,T2 € U 
if 


N71, Eli (ta) nit) wiw' i] > J (ty, T2). ii) For 
T,T2 EU, Il nz) — H(t) IS C, Wt. — 7 I. 


Assumption 8:(1) Let f;(-) denote the conditional 
density function of uj = vig — 3 — Zi — XieB given 
(xiZ;),for all teEU and i=1,---,N,f;(-) is 
continuously differentiable, f{(-) < Cy and f,(0) = 
cr. (li) For i=1,---,N,the derivative f/(-) satisfy 
Ifi (U)| < C. 


Theorem 1 Under Assumptions 1-8, 


VN (P25- vor() -y¥()) > GO), 


where G(-) is a zero-mean Gaussian process with 
uniformly continuous sample paths and covariance 
function C(t1,T2) = SJ(t4, T2)S", J (t4, T2)is defined in 
Assumption 7, S= (Qew QuwQ' zw) * Qzw Qww where 
Qzwand Q,,yare defined in Assumption 3. 


Remark 1: Under Assumptions 1-8, 


VN (Pos- IVQR (t) -y(@)) > N(O,V), 
where V = SJ(t,T)S’. 


4. Monte Carlo 
The samples are generated from the following 
model: 


Vg 4 
Vit = Xj +28 + €; + Viz, Z; = TW; + O; + Uj, E; 
= no; = k, 


wherev;;, 0; ~ U(0,1), Xit, wi, Uj; ~ Exp(0.25 * 
N(0,1)),@a =B =2,7=n=1,k=0.5. Notice that 
E(&) = E(e|w;) = nE(@;) —k =0.5—-0.5=0. For 
the sake of comparing the performance and 
efficiency between different methods, we compare 
the Bias and RMSE of the following estimators: 
fixed effects quantile regression (QR) and grouped 
IV quantile regression (Grouped IVQR) as in 
Chetverikov, Larsen, and Palmer(2016).Here we 
consider two different forms of generation of z;. 
The first case is z; being endogenous (correlated 
with e;through 6;). In the second case, z;is 
exogenous, where we set z; = w;.In the simulations, 
we report results 
considering 

{(N, T)| (25,25), (25,50), (100,25), (100,50)}, 
andquantilest = {0.1,0.25,0.5,0.75,0.9}. We set the 
number of replications to 1000. 





@ WTSRD | Unique Paper ID-UTSRD47716 | Volume—5 | Issue—6 | Sep-Oct 2021 


Page 1759 


International Journal of Trend in Scientific Research and Development @ www.ijtsrd.com eISSN: 2456-6470 


Table 1Bias and RMSE of estimators when z; is endogenous 





































































































0.1 | 0.25 0.5 0.75 0.9  Avg.abs. 

ae Bias | 0.3607 | 0.3909 | 0.3754 | 0.3929 | 0.4031 | 0.3846 

RMSE | 0.0241 | 0.0256 | 0.0247 | 0.0258 | 0.0267 | 0.0254 

Bias | -0.0636 | -0.1002 | -0.1022 | -0.1254 | -0.0464 | 0.0876 

N= 25,7= 25 | Grouped IVQR | RvisE | 0.0208 | 0.0399 | 0.0390 | 0.0557 | 0.0389 | 0.0388 
rs1voR _|_Bias_|-0.0662 |-0.0720 |-0.0711 |-0.1186 | 0.0032 | 0.0602 

RMSE | 0.0208 | 0.0256 | 0.0254 | 0.0599 | 0.0668 | 0.0397 

on Bias | 0.3570 | 0.3796 | 0.3864 | 0.4090 | 0.4172 | 0.3899 

RMSE | 0.0149 | 0.0158 | 0.0161 | 0.0169 | 0.0173 | 0.0162 

w=25,7=50 | Grouped 1VOR | Bits_| 0.0243 | -0.0468 | -0.0239 | -0.0337 | -0.0438 | 0.0345 
RMSE | 0.0111 | 0.0126 | 0.0130 | 0.0115 | 0.0111 | 0.0118 

rs-1voR _ |__Bias_| 0.0265 |-0.0526 | -0.0348 | -0.0282 | -0.0314 | 0.0347 

RMSE | 0.0078 | 0.0091 | 0.0090 | 0.0082 | 0.0080 | 0.0084 

x Bias | 0.3686 | 0.3772 | 0.3787 | 0.4033 | 0.4157 | 0.3887 

RMSE | 0.0244 | 0.0247 | 0.0247 | 0.0262 | 0.0272 | 0.0254 

n= 100,7-=25 | Grouped IVOR | Bia8_| -0.0670 | -0.0236 | -0.0762 |-0.0371 |-0.0772 | 0.0562 
RMSE | 0.0307 | 0.0504 | 0.0246 | 0.0711 | 0.0372 | 0.0428 

rsvor _ |Bias_|-0.0544 |-0.0316 |-0.0632 |-0.0321 | -0.0642 | 0.0491 

RMSE | 0.0268 | 0.0444 | 0.0199 | 0.0645 | 0.0333 | 0.0378 

a Bias | 0.3548 | 0.3838 | 0.3832 | 0.4029 | 0.4138 | 0.3877 

RMSE | 0.0182 | 0.0195 | 0.0194 | 0.0204 | 0.0211 | 0.0197 

w=100,7 =50 | Grouped 1VOR | Bi8s_| 0.0183 |-0.0451 | -0.0516 | -0.0166 | -0.0192 | 0.0301 
RMSE | 0.0105 | 0.0181 | 0.0149 | 0.0102 | 0.0100 | 0.0127 

rs-1voR  |_Bias_|-0.0166 | -0.0430 | -0.0537 |-0.0120 | -0.0244 | 0.0299 

RMSE | 0.0101 | 0.0179 | 0.0141 | 0.0090 | 0.0093 | 0.0121 








Table | provides the Bias and RMSE of the estimators in the case z; is endogenous. 
and RMSE of Grouped IVQR and 2S-IVQR estimators are better than QR estimators. 


It is clear that the Bias 


Moreover, the Bias and RMSE of 2S-IVQR estimator are smaller than those of Grouped IVQR estimator at 
most quantiles. Meanwhile, we also find that the Bias and RMSE decreases as N and T increases for 2S- 


IVQR estimators. 


Table 2 Bias and RMSE of estimators when z; is exogenous 



















































































: 0.25 0.5 0.75 0.9 Avg.abs. 
OR Bias | 0.0254 | 0.0292 | 0.0008 | 0.0131 | -0.0188 | 0.0174 
RMSE | 0.0139 | 0.0145 | 0.0127 | 0.0139 | 0.0136 | 0.0137 
w =25,¢=25 | Grouped IVOR Bias | -0.0085 | 0.0083 | 0.0274 | 0.0145 | 0.0012 | 0.0120 
RMSE | 0.0167 | 0.0194 | 0.0212 | 0.0192 | 0.0165 | 0.0186 
2S-IVOR Bias | 0.0036 | 0.0069 | -0.0047 | 0.0219 | 0.0027 | 0.0080 
RMSE | 0.0118 | 0.0136 | 0.0123 | 0.0124 | 0.0121 | 0.0124 
OR Bias | 0.0135 | -0.0023 | 0.0027 | -0.0078 | -0.0085 | 0.0070 
RMSE | 0.0098 | 0.0091 | 0.0096 | 0.0101 | 0.0096 | 0.0975 
w =25,¢=50 | Grouped IVOR Bias | 0.0037 | 0.0052 | 0.0033 | 0.0003 | -0.0072 | 0.0039 
RMSE | 0.0119 | 0.0137 | 0.0142 | 0.0120 | 0.0119 | 0.0127 
2S-IVOR Bias | 0.0069 | -0.0040 | 0.0037 | -0.0031 | -0.0029 | 0.0041 
RMSE | 0.0078 | 0.0086 | 0.0089 | 0.0083 | 0.0086 | 0.0084 
OR Bias | 0.0308 | -0.0140 | -0.0035 | -0.0161 | 0.0108 | 0.0150 
RMSE | 0.0141 | 0.0129 | 0.0128 | 0.0128 | 0.0137 | 0.0133 
w =100,6 = 25 | Grouped IVQR Bias | 0.0139 | -0.0286 | 0.0073 | -0.0123 | 0.0237 | 0.0172 
RMSE | 0.0130 | 0.0137 | 0.0145 | 0.0135 | 0.0137 | 0.0137 
2S-IVOR Bias | 0.0049 | -0.0166 | -0.0013 | -0.0052 | 0.0179 | 0.0092 
RMSE | 0.0121 | 0.0119 | 0.0127 | 0.0117 | 0.0125 | 0.0122 
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a Bias | -0.0032 | 0.0079 | -0.0039 | 0.0036 | -0.0123 | 0.0062 
RMSE | 0.0087 | 0.0089 | 0.0086 | 0.0094 | 0.0094 | 0.0090 

Bias | -0.0127 | 0.0042 | -0.0053 | 0.0006 | 0.0085 | 0.0062 

N = 100,6 = 50 | Grouped IVQR | PxisE | 0.0081 | 0.0089 | 0.0100 | 0.0097 | 0.0089 | 0.0091 
ds-1voR _|Bias_|-0.0080 | 0.0052 |-0.0025 | 0.0068 | 0.0029 | 0.0051 

; RMSE | 0.0078 | 0.0076 | 0.0084 | 0.0083 | 0.0082 | 0.0081 




















The Bias and RMSE of the estimators when z;isendogenous are shown in Table 2. The results are similar to 
those of Table 1. The results of Grouped IVQR and 2S-IVQR estimators are better than those of QR 
estimators and 2S-[VQR estimator performs better than Grouped IVQR in terms of Avg.abs.bias and Avg. 
RMSE. In addition, the Bias and RMSE decreases as N and T increases for 2S-IVQR estimators. 


5. Conclusion 

We explore a two-stage approach to instrumental 
variable quantile regression for panel data with 
time-invariant effects. In the first stage, the dummy 
variables are introduced to represent the time- 
invariant effects. By employing the dummy 
variables, the number of estimation parameters can 
be reduced. Then in the second stage, we apply 
instrument variables approach and 2SLS method. 
The proposed estimatoris a weighted combination 
of z; and w;.Moreover, the asymptotic properties of 
the proposed estimators are studied in Section 3. 
Monte Carlo simulation in various parameters sets 
proves the validity of the proposed approach. 
Monte Carlo simulation presents that with 
increasing sample size, the Bias and RMSE of our 
estimator are decreased. Besides, our estimator has 
lower Bias and RMSE than those of the QR 
estimator and the Grouped IVQR estimator. 
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7. Appendix 
Proof of Theorem 1Denote/(t) = (Z'PyZ)~1(Z'Pw9(t)), Qaw = 


ZIW Az _ ww 
N »Qww = N”’ 


VN (P2s— ver () - ¥(T)) = Qew Qinw ew) * Qew Qinw W'(8(z) — 9(t))/VN).(6) 
By Assumptions 3(iv) and 5 and Chebyshev’s inequality, we have 


then 





1 , eax 

< bini(Ziw' i — E[ziw'i}) 20. 

Observe that N~1 ¥*_, E[z;w';] > Q,yby Assumption 3, it suffices to prove 
i=1 y p p 

ee _ , P 

Qew =N ty Zw ie Qow- 


Similarly, it can be obtained 


Thus, 

P B.S woe. a ee ee = 

S= (0.0 Cau’ zw) “Onwlww _ OnCuwe woes) "Ori wal =S$ (7) 
According to Chetverikov, Larsen, and Palmer(2016), 

N 
1 : _ 
S(t) = Te (6:,(2) - 9,(0)) w's = 0, (1)(8) 

uniformly over t € U. 
Observe that(7) and(8)are satisfied, by (6)we have 
VN (f25- wor(t) -7(t)) = Op (1).(9) 
Next, we show VN(f(-) — y(-)) > GC). Note that 
VNC) YO) = $- EEL wimi(@). 


and $5 Sby (7). In addition, Lemma 3 of Chetverikov, Larsen, and Palmer(2016) implies that 


N 
1 
FD win = FO 
i=1 


where G®°(-) is a zero-mean Gaussian process with uniformly continuous sample paths and covariance 
function J (Tt, T2),J (tT, T2)1s defined in Assumption 7. Therefore, by Slutsky’s theorem, 


VNC) -7O) > GC), 
where G(-) is a zero-mean Gaussian process with uniformly continuous sample paths and covariance 


function C(t,,T2) = SJ(t,,T2)S'.Combining (9) gives the asserted claim and completes the proof of 
Theorem 1. 
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